Abstract: In this internet era, vast amount of data is available and is generated on a continuous basis. For a variety of purposes, identifying the area to which a particular piece of text belongs is very crucial. This enables various data mining tools to better handle the text in terms of information extraction/mining. In this project we aim to provide that preliminary meta-information about a particular piece of text. In the virtual world, this automation is manifested through the evolution of efficient algorithms. Part of the process of automation in the virtual world is also dependent on enabling machines to do the tasks that humans naturally do. Domain identification is one such technique. In this paper, we plan to highlight the efficient use of "Natural Language Processing" (NLP) techniques to identify the domain of a given piece of text.
Keywords: URL, keyword matching, domain, database storage.